Towards data-intensive testing of a broad-coverage LFG grammar
نویسنده
چکیده
This paper addresses the problem that manual checking of output representations becomes impracticable in extensive tests during grammar development or in data-intensive applications of the grammar, like grammar-based lexicon acquisition from corpora. A method of annotating the sentences to be parsed with target expressions is proposed, using the LFG formalism itself to specify the expressions, such that the check of the actual solutions against the target speci cation is performed by the standard LFG constraint solver.
منابع مشابه
Towards data - intensive testing of abroad - coverage LFG grammar Jonas
This paper addresses the problem that manual checking of output representations becomes impracticable in extensive tests during grammar development or in data-intensive applications of the grammar, like grammar-based lexicon acquisition from corpora. A method of annotating the sentences to be parsed with target expressions is proposed, using the LFG formalism itself to specify the expressions, ...
متن کاملA Comparison of Evaluation Metrics for a Broad-Coverage Stochastic Parser
This paper reports on the use of two distinct evaluation metrics for assessing a stochastic parsing model consisting of a broad-coverage Lexical-Functional Grammar (LFG), an efficient constraint-based parser and a stochastic disambiguation model. The first evaluation metric measures matches of predicate-argument relations in LFG f-structures (henceforth the LFG annotation scheme) to a gold stan...
متن کاملCross-Lingual Induction for Deep Broad-Coverage Syntax: A Case Study on German Participles
This paper is a case study on cross-lingual induction of lexical resources for deep, broad-coverage syntactic analysis of German. We use a parallel corpus to induce a classifier for German participles which can predict their syntactic category. By means of this classifier, we induce a resource of adverbial participles from a huge monolingual corpus of German. We integrate the resource into a Ge...
متن کاملTreebank-Based Acquisition of Multilingual Unification Grammar Resources
Deep unification(constraint-)based grammars are usually hand-crafted. Scaling such grammars from fragments to unrestricted text is time-consuming and expensive. This problem can be exacerbated in multilingual broad-coverage grammar development scenarios. Cahill et al. (2002, 2004) and O’Donovan et al. (2004) present an automatic f-structure annotation-based methodology to acquire broad-coverage...
متن کاملDeveloping German Semantics on the basis of Parallel LFG Grammars
This paper reports on the development of a core semantics for German which was implemented on the basis of an English semantics that converts LFG f-structures to flat meaning representations in a Neo-Davidsonian style. Thanks to the parallel design of the broad-coverage LFG grammars written in the context of the ParGram project (Butt et al., 2002) and the general surface independence of LFG f-s...
متن کامل